69 



IN THE SPECIFICATION: 

(1) Please replace the paragraph that begins at 
line 19 of page 5 and ends at line 5 of page 6 with the 
following replacement paragraph: 



Much of the coding sequence of the human 
genome is not homologous to known genes, making 
^ I detection of open reading frames ("ORFs") and 

predictions of gene function difficult. 
Computational methods exist for predicting coding 
regions in eukaryotic genomes. Gene prediction 
programs such as GRAIL and GRAIL II, Uberbacher et 
O al.r Proc. Natl. Acad. Sci. (7SA 88 (24) : 11251-5 

M (1991); Xu et al., Genet. Eng. 16:241-53 (1994); 

Uberbacher et al., Methods Enzymol. 265:259-81 
f|j (1996); GENEFINDER, Solovyev et al., Nucl . Acids. 

H Res. 22:5156-63 (1994); Solovyev et al., Ismb 

=1 5:294-302 (1997); and GENSCAN, Burge et al., J. 

if™ 

^ Mol. Biol. 268:78-94 (1997), predict many putative 



genes without known homology or function. Such 
programs are known, however, to give high false 
positive rates. Burset et al.. Genomics 34:353-367 
(1996) . Using a consensus obtained by a plurality 
of such programs is known to increase the 
reliability of calling exons from genomic sequence. 
Ansari-Lari et al., Genome Res. 8(l):29-40 (1998). 



(2) Please replace the paragraph that begins at 
line 30 of page 6 and ends at line 9 of page 7 with the 
following replacement paragraph: 



It is common for microarrays to be 
derived from cDNA/EST libraries, either from those 
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previously described in the literature, such as 
those from the I.M.A.G.E. consortium, Lennon et 
ai., "The I.M.A.G.E. Consortium: an Integrated 
Molecular Analysis of Genomes and Their Expression, 
Genomics 33(1): 151-2 (1996), or from the 
construction of "problem specific" libraries 
targeted at a particular biological question, R.S. 
Thomas et al., Toxicologist 54:68-69 (2000). Such 
microarrays by definition can measure expression 
only of those genes found in EST libraries, and 
thus have not been useful as probes for genes 
discovered solely by genomic sequencing. 



(3) Please replace the paragraph that begins at 
line 33 of page 23 and ends at line 8 of page 25 with the 
following replacement paragraph: 



As discussed below, and further described 
in detail in commonly owned and copending U.S. 
provisional application nos. 60/207,456, filed May 
26, 2000; 60/234,687, filed September 21, 2000; 
60/236,359, filed September 27, 2000; in commonly 
owned and copending U.K. patent application no. 
0024263.6, filed October 4, 2000; and in commonly 
owned and copending PCT applications 
PCT/USOl/00666; PCT/USOl/00 667 ; PCT/USOl/00664; 
PCT/USOl/00669; PCT/USOl/00665; PCT/USOl/00668; 
PCT/USOl/00663; PCT/USOl/00662; PCT/USOl/00661 ; and 
PCT/USOl/00670, the disclosures of which are 
incorporated herein by reference in their entirety, 
we have used the methods and apparatus of the 
present invention to identify more than 15,000 
exons in human genomic sequence whose expression we 
have confirmed in at least one human tissue or cell 



type. Fully two-thirds of the exons belong to 
genes that were not at the time of our discovery 
represented in existing public expression (EST, 
cDNA) databases, making the methods and apparatus 
of the present invention extremely powerful tools 
for novel gene discovery. 
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(4) Please replace the paragraph that begins at 
line 28 of page 76 and ends at line 3 of page 77 with the 
following replacement paragraph: 

One third of the probe sequences (as 
amplified) produced an exact match (BLAST Expect 

("£") values less than le-100 (1 x 10"^°°)) to 
either an EST (20% of sequences) or a known mRNA 

(13% of sequences) . A further 22% of the probe 
sequences showed some homology to a known EST or 
mRNA (BLAST E values from le-5 (1 x 10"^) to le-99 

(1 X 10"^^) ) . The remaining 45% of the probe 
sequences showed no significant sequence homology 



Q to any expressed, or potentially expressed, 



sequences present in public databases. 



(5) Please replace the paragraph that begins at 
line 26 of page 80 and ends at line 10 of page 81 with the 
following replacement paragraph: 

FIG. 7A is a matrix presenting the 
expression of all verified sequences that showed 
signal intensity greater than 3 in at least one 
tissue. Each clone is represented by a column in 
the matrix. Each of the 10 tissues assayed is 
represented by a separate row in the matrix, and 
relative expression (expression ratio) of a clone 
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in that tissue is indicated at the respective node 
by intensity of green shading, with the intensity 
legend shown in panel B. The top row of the matrix 
("EST Hit") contains "bioinf ormatic" rather than 
"physical" expression data - that is, presents the 
results returned by query of EST, NR and SwissProt 
databases using the probe sequence. The legend for 
"bioinformatic expression" (i.e., degree of 
homology returned) is presented in panel C. 
Briefly, white is known, black is novel, with gray 
depicting nonidentical with significant homology 
(white: E values < le-100 (1 x 10"^°°); gray: E 
values from le-5 (1 x 10"^) to le-99 (1 x 10"^^); 
black: E values > le-5 (1 x 10"^)). 



(6) Please replace the paragraph at lines 7-16 
of page 82 with the following replacement paragraph: 



FIG. 8 shows in dashed line the 
normalized Cy3 signal intensity for all sequence- 
verified products with a BLAST Expect ("E") value 
of greater than le-30 (1 x 10"^°) (designated 
"unknown") upon query of existing EST, NR and 
SwissProt databases, and shows in solid line the 
normalized Cy3 signal intensity for all sequence- 
verified products with a BLAST Expect value of less 
than le-30 (1 x lO'^O) ("known"). Note that 
biological background noise has an averaged 
normalized Cy3 signal intensity of 0.2. 



(7) Please replace the paragraph that begins at 
line 18 of page 96 and ends at line 22 of page 97 with the 
following replacement paragraph: 



Using this threshold, we identified over 
15,000 single exon probes that produce significant 
signal in one or more of ten tested tissues/cell 
types. The exact structures of these single exon 
probes are clearly presented in the SEQUENCE 
LISTINGS included in commonly owned and copending 
U.S. provisional application nos. 60/207,456, filed 
May 26, 2000; 60/234,687, filed September 21, 2000; 
60/236,359, filed September 27, 2000; in commonly 
owned and copending U.K. patent application no. 
0024263.6,. filed October 4, 2000; and in commonly 
owned and copending PCT applications 
PCT/USOl/00666; PCT/USOl/00667 ; PCT/USOl/00664 ; 
0 PCT/USOl/00669; PCT/USOl/00665; PCT/USOl/00668 ; 

S PCT/USOl/00663; PCT/USOl/00662 ; PCT/USOl/00661 ; and 

PCT/USOl/00670, the disclosures of which are 



j^j incorporated herein by reference in their 

0 entireties. 

JjJ . — — 
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0 (8) Please replace the paragraph that begins at 

^ line 12 of page 98 and ends at line 13 of page 99 with the 

iH following replacement paragraph: 




The exact structures of these single exon 
probes are clearly presented in the SEQUENCE 
LISTINGS included in commonly owned and copending 
U.S. provisional application nos. 60/207,456, filed 
May 26, 2000; 60/234,687, filed September 21, 2000; 
60/236,359, filed September 27, 2000; in commonly 
owned and copending U.K. patent application no. 
0024263.6, filed October 4, 2000; and in commonly 
owned and copending PCT applications 
PCT/USOl/00666; PCT/USOl/00667; PCT/USOl/00664; 
PCT/USOl/00669; PCT/USOl/00665; PCT/USOl/00668; 
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